Automatic Wikipedia Link Generation Based On Interlanguage Links
نویسنده
چکیده
This paper presents a new way to increase interconnectivity in small Wikipedias (fewer than a 100, 000 articles), by automatically linking articles based on interlanguage links. Many small Wikipedias have many articles with very few links, this is mainly due to the short article length. This makes it difficult to navigate between the articles. In many cases the article does exist for a small Wikipedia, however the article is just missing a link. Due to the fact that Wikipedias are translated in to many languages, it allows us to generate new links for small Wikipedias using the links from a large Wikipedia (more than a 100, 000 articles).
منابع مشابه
Enriching Wikipedia's Intra-language Links by their Cross-language Transfer
Although hyperlinks enhance the utility of Wikipedia, embedding them in articles imposes a burden on contributors. To alleviate this burden as well as enrich hyperlinks in Wikipedia articles, we propose a method for transferring intra-language links between different-language articles linked via an interlanguage link. The method avoids anchor selection and disambiguation problems by which usual...
متن کاملAutomatic generation of in-text hyperlinks in web publishing
We present a method for automatic generation of in-text explanatory hyperlinks for use in web publishing. A system using this method is currently in production as part of a service for enriching plaintext content. We recognize the importance of link anchors in practical use of such systems, therefore the method is centered around link anchor selection and uses semantic similarity only to resolv...
متن کاملWikiV3 results for OAEI 2017
WikiV3 is the successor of WikiMatch (participated in OAEI 2012 and 2013) which explores Wikipedia as one external knowledgebase for ontology matching. The results show that the matcher is slightly better than matchers based on string equality and can get higher recall values. Moreover due to the construction of the system it is able to compute mappings in a multilingual setup. 1 Presentation o...
متن کاملCross-lingual Semantic Relatedness Using Encyclopedic Knowledge
In this paper, we address the task of crosslingual semantic relatedness. We introduce a method that relies on the information extracted from Wikipedia, by exploiting the interlanguage links available between Wikipedia versions in multiple languages. Through experiments performed on several language pairs, we show that the method performs well, with a performance comparable to monolingual measur...
متن کاملEvaluation of ILP-Based Approaches for Partitioning into Colorful Components
The NP-hard Colorful Components problem is a graph partitioning problem on vertex-colored graphs. We identify a new application of Colorful Components in the correction of Wikipedia interlanguage links, and describe and compare three exact and two heuristic approaches. In particular, we devise two ILP formulations, one based on Hitting Set and one based on Clique Partition. Furthermore, we use ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1701.01858 شماره
صفحات -
تاریخ انتشار 2017